To alleviate the problem of structured databases' limited coverage, recent task-oriented dialogue systems incorporate external unstructured knowledge to guide the generation of system responses. However, these usually use word or sentence level similarities to detect the relevant knowledge context, which only partially capture the topical level relevance. In this paper, we examine how to better integrate topical information in knowledge grounded task-oriented dialogue and propose ``Topic-Aware Response Generation'' (TARG), an end-to-end response generation model. TARG incorporates multiple topic-aware attention mechanisms to derive the importance weighting scheme over dialogue utterances and external knowledge sources towards a better understanding of the dialogue history. Experimental results indicate that TARG achieves state-of-the-art performance in knowledge selection and response generation, outperforming previous state-of-the-art by 3.2, 3.6, and 4.2 points in EM, F1 and BLEU-4 respectively on Doc2Dial, and performing comparably with previous work on DSTC9; both being knowledge-grounded task-oriented dialogue datasets.
translated by 谷歌翻译
我们提出了Pangu-Coder,这是一种仅预读的解码器语言模型,该模型采用pangu-alpha架构进行文本到代码生成,即给定自然语言问题描述的编程语言解决方案的合成。我们使用两阶段策略训练Pangu-Coder:第一阶段采用因果语言建模(CLM)来预先培训原始编程语言数据,而第二阶段则使用因果语言建模和掩盖语言建模(MLM)的组合培训目标,专注于文本到代码生成的下游任务,并培训松散的自然语言程序定义和代码功能。最后,我们讨论了pangu-coder-ft,该pander the是通过竞争性编程问题和代码与持续集成测试的结合进行了微调的。我们评估了pangu-coder,重点是它是否生成功能上正确的程序,并证明它在参加较小的上下文窗口和较少的数据培训的同时,它比诸如Codex之类的类似大小的模型(例如Codex)实现等效性或更好的性能。
translated by 谷歌翻译
In the field of psychopathology, Ecological Momentary Assessment (EMA) methodological advancements have offered new opportunities to collect time-intensive, repeated and intra-individual measurements. This way, a large amount of data has become available, providing the means for further exploring mental disorders. Consequently, advanced machine learning (ML) methods are needed to understand data characteristics and uncover hidden and meaningful relationships regarding the underlying complex psychological processes. Among other uses, ML facilitates the identification of similar patterns in data of different individuals through clustering. This paper focuses on clustering multivariate time-series (MTS) data of individuals into several groups. Since clustering is an unsupervised problem, it is challenging to assess whether the resulting grouping is successful. Thus, we investigate different clustering methods based on different distance measures and assess them for the stability and quality of the derived clusters. These clustering steps are illustrated on a real-world EMA dataset, including 33 individuals and 15 variables. Through evaluation, the results of kernel-based clustering methods appear promising to identify meaningful groups in the data. So, efficient representations of EMA data play an important role in clustering.
translated by 谷歌翻译
边缘计算是一项有前途的技术,可以在需要瞬时数据处理的技术领域提供新功能。机器和深度学习等领域的研究人员对其应用程序进行了广泛的边缘和云计算,这主要是由于他们提供的大量计算和存储资源。目前,机器人技术也正在寻求利用这些功能,并且随着5G网络的开发,可以克服该领域的一些现有限制。在这种情况下,重要的是要知道如何利用新兴的边缘体系结构,当今存在哪些类型的边缘体系结构和平台,以及哪些可以并且应该基于每个机器人应用程序使用。一般而言,边缘平台可以以不同的方式实现和使用,尤其是因为有几个提供商提供或多或少提供的一组服务以及一些基本差异。因此,本研究针对那些从事下一代机器人系统开发的人解决了这些讨论,并将有助于理解每个边缘计算体系结构的优势和缺点,以便明智地选择适合每个应用程序的功能。
translated by 谷歌翻译
3D扫描技术的最新进展使得在数字双胞胎,远程检验和逆向工程等各种工业应用中部署了3D模型。尽管他们不断变化的性能,3D扫描仪,仍然在所获取的密集模型中引入噪音和伪影。在这项工作中,我们为密集3D扫描工业模型提出了一种快速且坚固的去噪方法。所提出的方法采用条件变化自动化器来有效地滤除面正线。在滑动补丁设置中执行培训和推理,从而减少所需培训数据和执行时间的大小。我们使用3D扫描和CAD模型进行了广泛的评估研究。结果验证了合理的去噪结果,与其他最先进的方法相比,展示了类似或更高的重建准确性。具体地,对于具有超过1E4面的3D模型,所示的管道是具有等效重建误差的方法的两倍。
translated by 谷歌翻译